CDS

Accession Number TCMCG042C33321
gbkey CDS
Protein Id XP_016465457.1
Location complement(join(898..1212,1292..1431,1508..1647,1989..2074,2150..2295,2384..2456,3551..3686,3892..3938,4794..4839,5607..5719,5806..5841,5939..6142,8775..8840))
Gene LOC107788296
GeneID 107788296
Organism Nicotiana tabacum

Protein

Length 515aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA319578
db_source XM_016609971.1
Definition PREDICTED: DNA mismatch repair protein MSH4-like isoform X3 [Nicotiana tabacum]

EGGNOG-MAPPER Annotation

COG_category L
Description MutS family domain IV
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08740        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAAGACGACACCGGAGAACGATCAAGCTTCGTCATCGGCCTCATCGAGAATCGCGCCAAGGAGGTTGGAGTAGCTGCATTTGACTTGAGATCAGCTTCACTGCATCTTTCTCAGTATATAGAGACTAGCAGCTCCTATCAAAATACAAAGACTTTACTACAGTTCTATGAACCTATGGCGATCATTGTTCCTCCAAATAAACTGGCAGCAGATGGAATGGTCGGAGTCTCACAGCTAGCTGATGGATTTTGTTCCTCCACAAGAAAGGTTATAATGAATCGTGGTTGCTTTGATGATACCAGGGGAGCTGTACTGGTTAAAGGATTAGCTGCTAAGGAACCGTCTGCTCTTGGTTTGGACTCATACTACAAGCAATATTATTTGTGTCTGGCTGCAGCAGCGGCAACTATCAAGTGGATTGAAGCAGAGAAAGGTGTTATTGTGGTAAATCACTCTTTGCTGGTTACCTTCAATGGATCTTTTGACCACATGAACATTGATGCCACCAGTGTTCAGAACTTGGAGATAATTGAGCCTATGCACTCTTCTCTTTTGGGCACAAACAGCAAAAAGAGAAGTTTATTCCACATGCTTAAAACAACTCGAACTATTGGCGGGTATGGAAATAATTGCATTGATAAGCTGGATGAGTTGATGAGCAATGAGCAGCTATTCTTTGGCTTGTCTCAGGCCCTTCGTAAGTTTCCGAAAGAAACAGATAGGGTCCTTTGTCACTTCTGCTTCAAGCCAAAGAGAGTTACTAATGAAGTCTTGGCTTCAGATAATGGAAGGAGGAACCAAATTATGATATCCAGCATTATTCTTCTCAAAACCGCTCTTGATGCTTTACCGTTACTCTCCCAGGTGCTTAAAGAAGCCAAGAGTTGTCTGCTGGGAAATGTTTACAAGTCCATATGTGAGAATGAAAAATATACTTCAATCAGGAACAGAATTGGAGAAGTGATTGATGAAGATGTCCTTCATACACGAGTTCCTTTTGTTGCACGGACACAGCAGTGTTTTGCTCTTAAGGCTGGAGTTGATGGGCTTCTTGATATGGCCCGTAGATCATTTTGTGACACCAGCGAAGCTATATACGACCTAGCAAATAAGTATCGTGAAGATTTCAGACTGCCAAACTTGAAGATCCCATTCAATAACAGGAAAGGGTTTTACTTTAGCATTCCGCAAAAGGACATACAGGGAAAACTACCCAGCAAGTTCATCCAGGTCATGAAACATGGAAACAATGTCCATTGCTCCAGTCTTGAACTTGCTTCAGTGAGTATTACCTGGCATCCATTGCTACATTCCTCTGTAGGCTCCATTTGGTTTGGTGCAAAACATATTCCAGAAGATATTGCTCTTGCAAATTTTCATGAGGAAAACCCGGCAAATCAACTTGAAGCAATAACTCTTAAGGGCGGGAGTGAATGCACCATTGGGGATGGGGCACGAAGGCTAAAAGAAAAAGTAGTGGATCAGCGTAAAATGTTTTCAATATGCAATAATGTGAAAGGGAAGTGTGGGAATACTGTTTGGTGA
Protein:  
MEDDTGERSSFVIGLIENRAKEVGVAAFDLRSASLHLSQYIETSSSYQNTKTLLQFYEPMAIIVPPNKLAADGMVGVSQLADGFCSSTRKVIMNRGCFDDTRGAVLVKGLAAKEPSALGLDSYYKQYYLCLAAAAATIKWIEAEKGVIVVNHSLLVTFNGSFDHMNIDATSVQNLEIIEPMHSSLLGTNSKKRSLFHMLKTTRTIGGYGNNCIDKLDELMSNEQLFFGLSQALRKFPKETDRVLCHFCFKPKRVTNEVLASDNGRRNQIMISSIILLKTALDALPLLSQVLKEAKSCLLGNVYKSICENEKYTSIRNRIGEVIDEDVLHTRVPFVARTQQCFALKAGVDGLLDMARRSFCDTSEAIYDLANKYREDFRLPNLKIPFNNRKGFYFSIPQKDIQGKLPSKFIQVMKHGNNVHCSSLELASVSITWHPLLHSSVGSIWFGAKHIPEDIALANFHEENPANQLEAITLKGGSECTIGDGARRLKEKVVDQRKMFSICNNVKGKCGNTVW